Prediction across sensory modalities: A neurocomputational model of the McGurk effect.
نویسندگان
چکیده
The McGurk effect is a textbook illustration of the automaticity with which the human brain integrates audio-visual speech. It shows that even incongruent audiovisual (AV) speech stimuli can be combined into percepts that correspond neither to the auditory nor to the visual input, but to a mix of both. Typically, when presented with, e.g., visual /aga/ and acoustic /aba/ we perceive an illusory /ada/. In the inverse situation, however, when acoustic /aga/ is paired with visual /aba/, we perceive a combination of both stimuli, i.e., /abga/ or /agba/. Here we assessed the role of dynamic cross-modal predictions in the outcome of AV speech integration using a computational model that processes continuous audiovisual speech sensory inputs in a predictive coding framework. The model involves three processing levels: sensory units, units that encode the dynamics of stimuli, and multimodal recognition/identity units. The model exhibits a dynamic prediction behavior because evidence about speech tokens can be asynchronous across sensory modality, allowing for updating the activity of the recognition units from one modality while sending top-down predictions to the other modality. We explored the model's response to congruent and incongruent AV stimuli and found that, in the two-dimensional feature space spanned by the speech second formant and lip aperture, fusion stimuli are located in the neighborhood of congruent /ada/, which therefore provides a valid match. Conversely, stimuli that lead to combination percepts do not have a unique valid neighbor. In that case, acoustic and visual cues are both highly salient and generate conflicting predictions in the other modality that cannot be fused, forcing the elaboration of a combinatorial solution. We propose that dynamic predictive mechanisms play a decisive role in the dichotomous perception of incongruent audiovisual inputs.
منابع مشابه
Evidence for a Generic Process Underlying Multisensory Integration
It has been shown repeatedly that the various sensory modalities interact with each other and that the integration of incongruent percepts across two modalities, such as vision and audition, can lead to illusions. Different individual cognitive features (i.e., attention, linguistic experience, etc.) have been shown to modulate the level of multisensory integration. As such, it may be hypothesiz...
متن کاملCross-modal effects in statistical learning: Evidence from the McGurk illusion
Statistical learning is assumed to play a vital role in language acquisition, yet it is unknown whether it is guided by a unitary, modality-general mechanism, or by several sensory-specific mechanisms. Consistent with the latter view, Seitz et al (2007) tested learners with multimodal input and found that statistical learning in one modality is independent of input to other modalities. We teste...
متن کاملIntegrating speech information across talkers, gender, and sensory modality: female faces and male voices in the McGurk effect.
Studies of the McGurk effect have shown that when discrepant phonetic information is delivered to the auditory and visual modalities, the information is combined into a new percept not originally presented to either modality. In typical experiments, the auditory and visual speech signals are generated by the same talker. The present experiment examined whether a discrepancy in the gender of the...
متن کاملRunning title: McGurk fusions across phonetic contexts
The McGurk effect has generally been studied within a limited range of phonetic contexts. With the goal of characterizing the McGurk effect through a wider range of contexts, a parametric investigation across three different vowel contexts, Iii, Ia/, and /u/, and two different syllable types, consonant-vowel (CV) and vowel-consonant (VC), was conducted. This paper discusses context-dependent ch...
متن کاملThe Application of Illusions and Psychoacoustics to Small Loudspeaker Configurations
A brief overview of some auditory illusions is given which serves merely as a ‘catalogue’, rather than a lengthy discussion. A related topic to auditory illusions is the interaction between different sensory modalities, e.g. sound and vision, a famous example is the McGurk effect (‘Hearing lips and seeing voices’) [1]. An auditory-visual overview is given in [2], a more general multisensory pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Cortex; a journal devoted to the study of the nervous system and behavior
دوره 68 شماره
صفحات -
تاریخ انتشار 2015